FURTHER APPLICATIONS OF A POWER SERIES METHOD FOR 

PATTERN AVOIDANCE 
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Abstract. In combinatorics on words, a word w over an alphabet E is said to avoid a 
pattern p over an alphabet A if there is no factor xofw and no non-erasing morphism h from 
A* to E* such that h(p) — x. Bell and Goh have recently applied an algebraic technique 
due to Golod to show that for a certain wide class of patterns p there are exponentially 
many words of length n over a 4-letter alphabet that avoid p. We consider some further 
consequences of their work. In particular, we show that any pattern with k variables of 
length at least 4 fc is avoidable on the binary alphabet. This improves an earlier bound due 
to Cassaigne and Roth. 



1. Introduction 

In combinatorics on words, the notion of an avoidable/unavoidable pattern was first intro- 
duced (independently) by Bean, Ehrenfeucht, and McNulty [I] and Zimin [22J. Let £ and 
A be alphabets: the alphabet A is the pattern alphabet and its elements are variables. A 
pattern p is a non-empty word over A. A word w over £ is an instance of p if there exists 
a non-erasing morphism h : A* — > E* such that h(p) = w. A pattern p is avoidable if there 
exists infinitely many words x over a finite alphabet such that no factor of x is an instance 
of p. Otherwise, p is unavoidable. If p is avoided by infinitely many words on an m-letter 
alphabet then it is said to be m-avoidable. The survey chapter in Lothaire [121 Chapter 3] 
gives a good overview of the main results concerning avoidable patterns. 

The classical results of Thue [TSJ [2U] established that the pattern xx is 3-avoidable and 
the pattern xxx is 2-avoidable. Schmidt [T7] (see also [2]) proved that any binary pattern 
of length at least 13 is 2-avoidable; Roth \Tb\ showed that the bound of 13 can be replaced 
by 6. Cassaigne [7J and Vanicek (21] (see [10]) determed exactly the set of binary patterns 
that are 2-avoidable. 

Bean, Ehrenfeucht, and McNulty [1] and Zimin [22] characterized the avoidable patterns 
in general. Let us call a pattern p for which all variables occurring in p occur at least twice 
a doubled pattern. A consequence of the characterization of the avoidable patterns is that 
any doubled pattern is avoidable. Bell and Goh [3] proved the much stronger result that 
every doubled pattern is 4-avoidable. Cassaigne and Roth (see [S] or JT2J Chapter 3]) proved 
that any pattern containing k distinct variables and having length greater than 200 • 5 fc is 
2-avoidable. In this note we apply the arguments of Bell and Goh to show the following 
result, which improves that of Cassaigne and Roth. 

Theorem 1. Let k be a positive integer and letp be a pattern containing k distinct variables. 
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(a) If p has length at least 2 k then p is ^-avoidable. 

(b) If p has length at least 3 h then p is 3-avoidable. 

(c) If p has length at least 4 h then p is 2-avoidable. 



2. A POWER SERIES APPROACH 



Rather than simply wishing to show the avoidability of a pattern p, one may wish instead 
to determine the number of words of length n over an m-letter alphabet that avoid p (see, 
for instance, Berstel's survey [1]). Brinkhuis [6] and Brandenburg [5] showed that there are 
exponentially many words of length n over a 3-letter alphabet that avoid the pattern xx. 
Similarly, Brandenburg showed that there are exponentially many words of length n over a 
2-letter alphabet that avoid the pattern xxx. 

As previously mentioned, Bell and Goh proved that every doubled pattern is 4-avoidable. 
In fact, they proved the stronger result that there are exponentially many words of length 
n over a 4-letter alphabet that avoid a given doubled pattern. Their main tool in obtaining 
this result is the following. 

Theorem 2 (Golod). Let S be a set of words over an m-letter alphabet, each word of length 
at least 2. Suppose that for each i > 2, the set S contains at most q words of length i. If 
the power series expansion of 



has non-negative coefficients, then there are least [x n ]G(x) words of length n over an m-letter 
alphabet that avoid S . 

Theorem[2]was originally presented by Golod (see Rowen [T6j Lemma 6.2.7]) in an algebraic 
setting. We have restated it here using combinatorial terminology. The proof given in 
Rowen's book also is phrased in algebraic terminology; in order to make the technique 
perhaps a little more accessible to combinatorialists, we present a proof of Theorem [2] using 
combinatorial language. 

Proof of Theorem^ For two power series f(x) = J2i>o a i xt an d g(x) = J2i>o^i x \ we write 
/ > g to mean that a« > bi for all i > 0. Let F(x) := J2i>o a i xl y where a, is the number 
of words of length i over an m-letter alphabet that avoid S. Let G(x) = Yli>o ^ e the 
power series expansion of G defined above. We wish to show F > G. 

For h > 1, there are mf — words w of length k over an m-letter alphabet that contain 
a word in S as a factor. On the other hand, for any such w either (a) w = w'a, where a is 
a single letter and w' is a word of length k — 1 containing a word in S as a factor; or (b) 
w = xy, where x is a word of length k — j that avoids S and y G S is a word of length j. 
There are at most (m k ~ 1 —ak-i)m words w of the form (a), and there are at most Ylj a k-jCj 
words w of the form (b). We thus have the inequality 



(1) 
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Rearranging, we have 

(2) a k - a k -im + 2J ^k-jCj > 0, 



for k > 1. 

Consider the function 



if(z) 




c,-a; J 



Observe that for > 1, we have [x ]H(x) = a k — + Ylj a k-j c j- By ([2]), we have 

[x k ]H(x) > for k > 1. Since = 1, the inequality H > 1 holds, and in particular, 

H — 1 has non- negative coefficients. We conclude that F = iJG = (H — 1)G + G > G, as 
required. □ 

Theorem [2] bears a certain resemblance to the Goulden- Jackson cluster method [Til Sec- 
tion 2.8], which also produces a formula similar to ([1]). The cluster method yields an exact 
enumeration of the words avoiding the set S but requires S to be finite. By constrast, The- 
orem [2] only gives a lower bound on the number of words avoiding S, but now the set S can 
be infinite. 

Theorem [2] can be viewed as a non-constructive method to show the avoidability of pat- 
terns over an alphabet of a certain size. In this sense it is somewhat reminiscent of the 
probabilistic approach to pattern avoidance using the Lovasz local lemma (see [2], [9]). For 
pattern avoidance it may even be more powerful than the local lemma in certain respects. 
For instance, Pegden [13] proved that doubled patterns are 22-avoidable using the local 
lemma, whereas Bell and Goh were able to show 4-avoidability using Theorem [2j Similarly, 
the reader may find it a pleasant exercise to show using Theorem [2] that there are infin- 
itely many words avoiding xx over a 7-letter alphabet; as far as we are aware, the smallest 
alphabet size for which the avoidability of xx has been shown using the local lemma is 13 

3. Proof of Theorem CD 



To prove Theorem [T] we begin with some lemmas 



Lemma 3. Let k > 1 and m > 2 be integers. If w is a word of length at least m k over a 
k-letter alphabet, then w contains a non-empty factor w' such that the number of occurrences 
of each letter in w' is a multiple of m. 

Proof. Suppose w is over the alphabet £ = {1,2, ... ,k}. Define the map ip : X* — > N fc that 
maps a word x to the /c-tuple mod m, . . . ,\x\f. mod m], where \x\ a denotes the number 
of occurrences of the letter a in x. For each prefix Wi of length % of w, let v$ = ip(wi). Since 
w has length at least m k , w has at least m k + 1 prefixes, but there are at most m k distinct 
tuples V{. There exists therefore i < j such that Vj = Vj. However, if w' is the suffix of Wj of 
length j — i, then ip(w') — Vj — Vi — [0, . . . , 0], and hence the number of occurrences of each 
letter in w' is a multiple of m. □ 



Lemma 4 (|3J). Let k > 1 be a integer and let p be a pattern over the pattern alphabet 
{xi, . . . , Xk}- Suppose that for 1 < i < k, the variable Xi occurs Oj > 1 times in p. Let m > 2 
be an integer and let £ be an m-letter alphabet. Then for n > \, the number of words of 
length n over £ that are instances of the pattern p is at most [x n ]C(x), where 



H hifc^aiuH Va k i k 



c (*) ■= E • • • E m " 

For the proof of the next lemma, we essentially follow the approach of Bell and Goh. 

Lemma 5. Let k > 2 be an integer and let p be a pattern over a k-letter pattern alphabet 
such that every variable occuring in p occurs at least \i times. 

(a) If fi = 3, then for n > 0, there are at least 2.94 n words of length n avoiding p over a 
3-letter alphabet. 

(b) If fi = 4, then for n > 0, there are at least 1.94 n words of length n avoiding p over a 
2-letter alphabet. 

Proof. Let (m, //) G {(3, 3), (2, 4)} and let S be an m-letter alphabet. Define S to be the set 
of all words over E that are instances of the pattern p. By Lemma HI the number of words 
of length n in S is at most [a; n ]C(a:), where 



+a k i k 



ij>i i k >i 
and for 1 < i < k we have > fi. Define 

B(x) := btx* = (l-mx + C(x)) -1 , 

and set A := m — 0.06. We claim that b n > A6 n _i for all n > 0. This suffices to prove the 
lemma, as we would then have b n > A n and the result follows by an application of Theorem [2j 
We prove the claim by induction on n. When n = 0, we have bo = 1 and b\ = m. 
Since m > A, the inequality bi > Xb holds, as required. Suppose that for all j < n, 
we have bj > \bj_i. Since B — (1 — mx + C) -1 , we have B(l — mx + C) — 1. Hence 
— mx + C) = for n > 1. However, 

■+«fc a; aiuH hQfcifc 




5(1 - mx + C) = foiX^ ] ( 1 - mx + • • • 



so 



[x n ]5(l - mx + C) = b n - 6 n _ x m + E " ' E mil+ '" +ifc& «-(ain+--+a fc i fc ) = 0. 

ii>l i k >l 

Rearranging, we obtain 

b n = \b n -i + (m - A)6 n _i - E ' E mll+ ' +lkh n-{aiii+-+a k i k )- 

h>i i k >\ 

To show b n > A6 n _i it therefore suffices to show 

(3) (m - A)6 n _x - E • • • E ™ n+ "' +lfc& n-Kn+ -+a felfc ) > 0. 



Since bj > Xbj-i for all j < n, we have 6 n _j < 6 n _i/A* 1 for 1 < i < n. Hence 

E"'E mil+ + * fc 6 n _( aiil+ ... +afe j fc ) 

il >1 ifc>l 

< y ...y ;/ ,' >■ v x Yi . 

\a\ii-\ \-a k i k 



X° 

«1>1 *fc>l 



HH Nfc 



^aiiiH hafeife 

H>1 i fc >l 

l!>l i fe >l 

ii>l *fe>l 



( £ £ 



Aft. 
Xh 



n-l 



m/X 



n-l 



1 - m/X^ 

m 
A^ — m 

m 



_ A^ 1 — m / 

In order to show that ([3]) holds, it thus suffices to show that 

{ m 

m — X > X 



X** — m 

Recall that m — A = 0.06. For (m, /i) = (3, 3) we have 

2 



2.0 1 ( 2 _ - I = 0.or,2G77---< 0.0G. 



and for (m, /i) = (2, 4) we have 

2 



1.94 = 0.052439- ••< 0.06, 

1.94 4 - 2y 

as required. This completes the proof of the inductive claim and the proof of the lemma. □ 

We can now complete the proof of Theorem [TJ Let p be a pattern with k variables. If p 
has length at least 2 fc , then by Lemma [3j the pattern p contains a non-empty factor p' such 
that each variable occurring in p' occurs at least twice. However, Bell and Goh showed that 
such a p' is 4- avoidable and hence p is 4- avoidable. 

Similarly, if p has length at least 3 fc (resp. 4 fe ), then by Lemma [31 the pattern p contains 
a non-empty factor p' such that each variable occurring in p' occurs at least 3 times (resp. 
4 times). If p' contains only one distinct variable, recall that we have already noted in the 



introduction that the pattern xxx is 2-avoidable (and hence also 3-avoidable). If p' contains 
at least two distinct variables, then by Lemma [5j the pattern p' is 3-avoidable (resp. 2- 
avoidable), and hence the pattern p is 3-avoidable (resp. 2-avoidable). This completes the 
proof of Theorem [TJ 

Recall that Cassaigne and Roth showed that any pattern p over k variables of length 
greater than 200 • 5 k is 2-avoidable. Their proof is constructive but is rather difficult. We 
are able to obtain the much better bound of 4 k non-constructively by a somewhat simpler 
argument. Cassaigne suggests (see the open problem [12, Problem 3.3.2]) that the bound of 
3 fc in Theorem [TJb) can perhaps be replaced by 2 k and that the bound of 4 fc in Theorem [Tfc) 
can perhaps be replaced by 3-2 fc . Note that the bound of 2 k in Theorem[]Ja) is optimal, since 
the Zimin pattern on k- variables (see [T2l Chapter 3]) has length 2 k — 1 and is unavoidable. 
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